feat(k6): add support for k8s loading/unloading of Seldon CRs #5563

lc525 · 2024-05-02T17:47:59Z

This adds initial support for k8s via the USE_KUBE_CONTROL_PLANE environment variable:

models/pipelines/experiments in components/model.js are modified to also return CR yamls
updates functions in components/utils.js (including setupBase(...) and teardownBase(...) to use xk6-kubernetes, when configured
all scenarios already using setupBase(...) should work unchanged

Which issue(s) this PR fixes:

INFRA-949 (internal issue) Extend existing k6 scenarios to use xk6-kubernetes

TODO

Test functionality in kind
Fix getting model/pipeline/experiment CRs back from k8s
Fix model/pipeline/experiment deletion

This adds initial support for k8s via the `USE_KUBE_CONTROL_PLANE` environment variable: - models/pipelines/experiments in components/model.js are modified to also return CR yamls - updates functions in components/utils.js (including `setupBase(...)` and `teardownBase(...)` to use xk6-kubernetes, when configured - all scenarios already using `setupBase(...)` should work unchanged **Which issue(s) this PR fixes:** - INFRA-949 (internal issue) Extend existing k6 scenarios to use xk6-kubernetes - [ ] Test functionality in kind - [ ] Extend existing k6 scenarios to use xk6-kubernetes

- works: model creation + load - fails: getting model status via k8s (permissions) - fails: deleting/tear down models (permissions)

lc525 · 2024-05-02T22:54:52Z

tests/k6/configs/k8s/base/k6.yaml

        - name: SCHEDULER_ENDPOINT
          value: "${SCHEDULER_ENDPOINT}:9004"
        - name: INFER_HTTP_ITERATIONS
          value: "1"
        - name: INFER_GRPC_ITERATIONS
          value: "1"
        - name: MODELNAME_PREFIX
-          value: "tfsimplea,pytorch_cifar10a,tfmnista,mlflow_winea,irisa"
+          value: "tfsimplea,pytorch-cifar10a,tfmnista,mlflow-winea,irisa"


names changed because the ones with _ were not getting loaded in k8s

lc525 · 2024-05-03T13:52:57Z

tests/k6/components/model.js

@@ -97,32 +101,32 @@ export function getModelInferencePayload(modelName, inferBatchSize) {
        const shape = [inferBatchSize, 16]
        var httpBytes = []
        var grpcBytes = []
-        
+


changes from here to line 208 are linting changes

sakoush

lovely! this is a great feature moving forward.

I guess the scenarios we care about can use USE_KUBE_CONTROL_PLANE, i.e. infer_constant_rate.js and infer_constant_vu.js.

tests/k6/Makefile

sakoush · 2024-05-03T13:52:29Z

tests/k6/components/k8s.js

@@ -0,0 +1,103 @@
+import { Kubernetes } from "k6/x/kubernetes";
+import { getConfig } from '../components/settings.js'
+import {


in the future we might want also to check the status via kube. not a blocker for this PR though.

Indeed, was already planning for that. The xk6-kubernetes extension is a bit wierd in that it doesn't have built-in functionality for waiting on a given condition/status. Also, it throws exceptions for any unexpected conditions, so I'll have to code that a bit defensively (via repeated resource gets).

sakoush · 2024-05-03T13:55:48Z

tests/k6/components/k8s.js

+}
+
+export function loadModel(modelName, data, awaitReady=true) {
+    //console.log(data)


yep, will clean those up. They were there in the scheduler code so I've left them for dev debugging

sakoush · 2024-05-03T13:57:39Z

tests/k6/components/k8s.js

+
+export function loadModel(modelName, data, awaitReady=true) {
+    //console.log(data)
+    if(!seldonObjExists(seldonObjectType.MODEL, modelName, namespace)) {


do we have to check if the model already exists?

If a model already exists here we get an exception. The exception likely happens only when the loaded model has the same CR as an existing one (it is an apply after all).

sakoush · 2024-05-03T13:58:04Z

tests/k6/components/k8s.js

+}
+
+export function unloadModel(modelName, awaitReady=true) {
+    // console.log("Unloading model "+modelName)


sakoush · 2024-05-03T13:58:14Z

tests/k6/components/k8s.js

+}
+
+export function loadPipeline(pipelineName, data, awaitReady=true) {
+    //console.log(data)


sakoush · 2024-05-03T13:59:02Z

tests/k6/components/model.js

@@ -1,3 +1,7 @@
+// import { dump as yamlDump } from "../import/js-yaml.mjs"


sakoush · 2024-05-03T13:59:46Z

tests/k6/components/model.js

        for (var i = 0; i < 16 * inferBatchSize; i++) {
            grpcBytes.push("MQ=="); // base64 of 1
            httpBytes.push("97")
        }
        const payload = {
-            "http": {"inputs":[{"name":"INPUT0","data":httpBytes,"datatype":"BYTES","shape":shape},{"name":"INPUT1","data":httpBytes,"datatype":"BYTES","shape":shape}]},
-            "grpc": {"inputs":[{"name":"INPUT0","contents":{"bytes_contents":grpcBytes},"datatype":"BYTES","shape":shape},{"name":"INPUT1","contents":{"bytes_contents":grpcBytes},"datatype":"BYTES","shape":shape}]}
+            "http": { "inputs": [{ "name": "INPUT0", "data": httpBytes, "datatype": "BYTES", "shape": shape }, { "name": "INPUT1", "data": httpBytes, "datatype": "BYTES", "shape": shape }] },


question: how did you reformat this file?

It was the automated formatter from intelliJ (we should try to find a cli formatter and add linting/formatting to the makefile)

sakoush · 2024-05-03T14:02:06Z

tests/k6/components/model.js

+            "storageUri": uri,
+            "requirements": modelTemplate.requirements,
+            "memory": (memoryBytes == null) ? modelTemplate.memoryBytes : memoryBytes,
+            "minReplicas": 1,


not sure if we need minReplicas, it will trigger I think autoscaling of models which we might want to do.

maybe at least align with the grpc api path.

sakoush · 2024-05-03T14:02:31Z

tests/k6/components/seldon.js

@@ -0,0 +1,5 @@
+export const seldonObjectType = {


- cleanup commented debug calls - remove minReplicas from CRs

lc525 added the v2 label May 2, 2024

fix disconnectScheduler function prototype

05ab4d8

lc525 changed the title ~~feat(k6): add support for k8s loading/unloading of Seldon CRs~~ feat(k6): add k6 support for k8s loading/unloading of Seldon CRs May 2, 2024

lc525 changed the title ~~feat(k6): add k6 support for k8s loading/unloading of Seldon CRs~~ feat(k6): add support for k8s loading/unloading of Seldon CRs May 2, 2024

fixes after testing in kind

2211aa1

- works: model creation + load - fails: getting model status via k8s (permissions) - fails: deleting/tear down models (permissions)

lc525 commented May 2, 2024

View reviewed changes

fix getting k8s CRs and deleting them

b505b7e

lc525 marked this pull request as ready for review May 3, 2024 13:50

lc525 requested a review from sakoush as a code owner May 3, 2024 13:50

lc525 commented May 3, 2024

View reviewed changes

sakoush approved these changes May 3, 2024

View reviewed changes

lc525 added 2 commits May 7, 2024 09:42

fixes following review comments

bb5fd35

- cleanup commented debug calls - remove minReplicas from CRs

add note on apply not picking up CR changes

a49a16c

lc525 merged commit 5ea6714 into SeldonIO:v2 May 7, 2024
3 checks passed

lc525 deleted the INFRA-949/k6-use-k8s branch May 7, 2024 13:17

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat(k6): add support for k8s loading/unloading of Seldon CRs #5563

feat(k6): add support for k8s loading/unloading of Seldon CRs #5563

lc525 commented May 2, 2024 •

edited

Loading

lc525 May 2, 2024

lc525 May 3, 2024

sakoush left a comment

sakoush May 3, 2024

lc525 May 7, 2024

sakoush May 3, 2024

lc525 May 7, 2024

sakoush May 3, 2024

lc525 May 7, 2024

sakoush May 3, 2024

sakoush May 3, 2024

sakoush May 3, 2024

sakoush May 3, 2024

lc525 May 7, 2024

sakoush May 3, 2024

sakoush May 3, 2024

		@@ -1,3 +1,7 @@
		// import { dump as yamlDump } from "../import/js-yaml.mjs"

feat(k6): add support for k8s loading/unloading of Seldon CRs #5563

feat(k6): add support for k8s loading/unloading of Seldon CRs #5563

Conversation

lc525 commented May 2, 2024 • edited Loading

TODO

Choose a reason for hiding this comment

Choose a reason for hiding this comment

sakoush left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

lc525 commented May 2, 2024 •

edited

Loading